Learning to Race: Experiments with a Simulated Race Car
نویسندگان
چکیده
Experiments with a Simulated Race Car Larry D. Pyeatt Adele E. Howe Colorado State University Fort Collins, CO 80523 email: fpyeatt,[email protected] URL: http://www.cs.colostate.edu/~fpyeatt,howeg Abstract We have implemented a reinforcement learning architecture as the reactive component of a two layer control system for a simulated race car. We have found that separating the layers has expedited gradually improving competition and multagent interaction. We ran experiments to test the tuning, decomposition and coordination of the low level behaviors. We then extended our control system to allow passing of other cars and tested its ability to avoid collisions. The best design used reinforcement learning with separate networks for each behavior, coarse coded input and a simple rule based coordination mechanism. Introduction Autonomous agents require a mix of behaviors, i.e., responses to di erent stimuli. This is especially true in situations where there are other agents present or where the environment is otherwise nondeterministic. For an agent to be e ective in its environment, it must have a large repertoire of behaviors and must be able to coordinate the use of those behaviors e ectively. Reactive systems have been favored for applications requiring quick responses, such as robotic or process control applications(Arkin 1994). Unfortunately, it is di cult to tune, decompose and coordinate reactive behaviors while ensuring consistency. In this paper, we present a case study of designing a mostly reactive autonomous agent for learning to perform a task in a multi-agent environment. The agent controls a race car in the Robot Automobile Racing Simulator (RARS) system (Timin 1995). To be successful, the agent must quickly respond to its environment and must have mastered several skills: steering, accelerating, and passing. To address these requirements, our basic agent architecture (Figure 1) adopts the common two-layer control structure: low level behaviors (implemented Copyright c 1998, American Association for Articial Intelligence (www.aaai.org). All rights reserved. Partial State Vector Sensors Effectors Actions Environment
منابع مشابه
Comparison of the maintenance of root canal anatomy in severely curved resin blocks using RaCe and Flexmaster rotary systems
Comparison of the maintenance of root canal anatomy in severely curved resin blocks using RaCe and Flexmaster rotary systems Dr. S. Khedmat* - Dr. M. Vahdatifar** *- Associate Professor of Endodontics Dept. - Faculty of Dentistry – Tehran University of Medical Sciences. **- Endodontist. Background and Aim: Various rotary systems are available on the market, therefore selection of a proper instr...
متن کاملLearning Coordinated Behaviors for Control of a Simulated Race Car
category: Control, navigation and planning This work is not in submission or press to any other conference. Abstract The demands of rapid response and the complexity of many environments make it diicult to decompose, tune and coordinate re-active behaviors while ensuring consistency. We hypothesize that complex behaviors should be decomposed into separate behaviors resident in separate networks...
متن کاملReinforcement Learning with Decision Trees
We present a decision tree based approach to function approximation in reinforcement learning. We compare our approach with table lookup and a neural network function approximator on three problems: the well known mountain car and pole balance problems as well as a simulated automobile race car. We find that the decision tree can provide better learning performance than the neural network funct...
متن کاملمقایسه تأثیر سیستمهای چرخشی Profile و Race بر آناتومی اولیه کانالهای مشابهسازی شده دارای انحنای شدید
Background and Aim: Recent advances in technology have permitted manufacturers to produce endodontic files from NiTi alloy and rotary systems. According to manufacturers, these files maintain original canal shape better than hand files and reduce treatment time. The aim of this study was to compare a new rotary system (Race) with an older one (Profile) in maintaining the original shape of curve...
متن کاملDecision Tree Function Approximation in Reinforcement Learning
We present a decision tree based approach to function approximation in reinforcement learning. We compare our approach with table lookup and a neural network function approximator on three problems: the well known mountain car and pole balance problems as well as a simulated automobile race car. We find that the decision tree can provide better learning performance than the neural network funct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998